USING PSTN TO CONVEY PARTICIPANT IP ADDRESSES 
FOR MULTIMEDIA CONFERENCING 



FIELD OF THE INVENTION 
5 [0001] The present invention relates to multimedia 

conferencing, and more particularly to transmitting the IP 
address of the conference master or MCU over the public switched 
telephone network. 

10 BACKGROUND OF THE INVENTION 

[0002] Multimedia conferencing, in which voice, image, data 
and video are shared among conference call participants, 
typically is conducted entirely within a packet mode of 
operation. Audio and extra-audio information is packetized at 

15 the end user station and is typically transmitted over a managed 
Internet Protocol (IP) network. A problem with this approach is 
that such a conference call is limited to participants with 
extra-audio capable user stations. Thus, a would-be participant 
having access only to a standard telephone instrument, referred 

20 to herein as a POTS (Plain Old Telephone Service) user, would 
not be able to participate in the conference in any capacity. 
Another problem with this approach is that user stations will 
often broadcast address request messages to determine the 
appropriate multipoint control unit (MCU) . This may present 

25 security problems. 

[0003] In light of these problems, it is desirable to have a 
multimedia conferencing system that will allow POTS users to 
participate at a base voice level. It is also desirable to 
transmit conference master or MCU addresses in a secure manner. 

30 

SUMMARY OF THE INVENTION 
[0004] Accordingly, it is an object of the present invention 
to describe a multimedia conferencing environment in which the 
voice portion of the conference call takes place over the public 
3 5 switched telephone system (PSTN) , and in which the IP address of 
the conference master or MCU is transmitted in-band over the 
PSTN, thus allowing extra-audio capable participants to conduct 
the extra-audio portion of the conference over an IP network. 
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[0005] The present invention is a hybrid multimedia 
conferencing system in which participants establish voice 
communications over the PSTN, and then exchange IP addresses in- 
band over the PSTN. The IP addresses are then available to 
5 establish extra-audio media streams over a packet network. 
Extra-audio capable user stations exchange IP address 
information over the voice communication channels, with 
subsequent extra-audio communications between user stations 
taking place over the packet network as unicast or multicast 
10 messages . 

[0006] For participants on user stations connected to the 
packet network, voice communications are directed over the PSTN 
via gateways. When a user station joins the conference call, it 
transmits its IP address in-band over its PSTN voice connection. 
15 A conference master responds by sending its IP address over the 
PSTN voice connection. Extra-audio connections are then 
negotiated and established between the user station and the 
conference master over the packet network via unicast messages. 
After establishment of extra-audio connections, subsequent 

2 0 communications over these channels between user stations takes 

place as unicast or multicast messages over the IP network. 
[0007] Since participants establish and maintain an audio 
teleconference over the PSTN, participants that are not extra- 
audio capable, such as POTS users, are still able to participate 
25 at the base voice level. Also, all participants may make full 
use of the teleconferencing capabilities and features of the 
PSTN service provider. 

[0008] Because extra-audio connection information is 
transmitted in-band over the PSTN connection to parties of the 

3 0 conference call and not communicated outside of the telephone 

call, security of the system is enhanced. 

DESCRIPTION OF THE DRAWINGS 
[0009] FIG. 1 shows a block diagram of a first embodiment of 
3 5 the present invention. 

[0010] FIG. 2 shows a message flow diagram of the first 
embodiment of the present invention. 
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[0011] FIG. 3 shows a block diagram of a second embodiment of 
the present invention. 

[0012] FIG. 4 shows a message flow diagram of the second 
embodiment of the present invention. 
5 [0013] FIG. 5 shows a block diagram of a user station of the 
present invention . 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
[0014] FIG, 1 shows a block diagram of a preferred embodiment 
10 of the present invention. User stations 1 and 6 are multimedia 
terminals capable of supporting at least voice and typically 
other types of extra-audio connections to a packet network. The 
multimedia user stations are typically PC based, and can support 
y a variety of real-time or near real-time collaborative 

.S 15 applications such as electronic whiteboarding and document 
ffl sharing. These user stations are referred to herein as extra- 

£1 audio capable. Telephone 7 is a standard telephone instrument 

that can operate over a switched circuit network, such as a 
an private network or the Public Switched Telephone Network (PSTN) . 

:\ 2 0 Telephone 7 is not extra-audio capable and is referred to herein 
1^, as a POTS user. In the preferred embodiment, IP network 8 is a 

G packet based managed Internet Protocol (IP) network such as a 

ii; LAN, WAN or MAN. IP network 8 can also be the Internet. In the 

2 preferred embodiment, PSTN 9 is the switched circuit PSTN. 

2 5 Network 9 may also be a private switched circuit network. 

Gateways 2 and 5 are network devices that perform the network 
and signaling translation required for the interworking of IP 
network 8 and PSTN 9. Gateways 2 and 5 provide for the 
interworking of networks 8 and 9 by, in particular, translating 

3 0 protocols for call setup and release and transferring 

information between the networks 8 and 9 . In the preferred 
embodiment, gateways 2 and 5 are iMerge Centrex Feature Gateways 
manufactured by AG Communication Systems, Inc. iMerge is a 
registered trademark of AG Communication Systems. Gateways 2 
3 5 and 5 can be other gateway devices that support well known 

gateway functions as well as functions required for the present 
invention. Central office (CO) switches 3 and 4 are designed 
and engineered to operate in switched circuit network PSTN 9, 
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and to operate with gateways 2 and 5. In the preferred 
embodiment, CO switches 3 and 4 are GTD-5 EAX Central Office 
Switches manufactured by AG Communication Systems, Inc. GTD-5 
is a registered trademark of GTE Corporation. 
5 [0015] User stations 1 and 6 are connected to IP network 8 
over standard TCP/IP connections. Links 10, 11 and 16, which 
connect user station 1 to gateway 2, user station 6 to gateway 5 
and user station 1 to user station 6, respectively, are logical 
links through IP network 8. These links typically comprise a 

10 wide variety of transmission equipment. Gateways 2 and 5 would 
typically be connected to IP network 8 over 10/lOObaseT Ethernet 
connections, and connected to CO switches 3 and 4 over GR-3 03 
connections 12 and 13. Connection 14 between central office 
switches 3 and 4 represents a logical connection through PSTN 9 . 

15 This connection will typically comprise a wide variety of 

switches and transmission equipment. Communications between CO 
switches 3 and 4 is well known in the art. Telephone instrument 
7 is connected to central office switch 3 over a standard 
subscriber line connection. 

20 [0016] User stations 1 and 6 will typically support a variety 
of packet based protocols allowing Voice Over IP (VOIP) 
communications over IP network 8, and also communications over 
PSTN 9 via gateways 2 and 5. These protocols include, for 
example, TCP/IP, H.323, SIP and SDP. Operation of a H.323 

25 network is described, inter alia, in standards publications 

^^H.323 Packet-Based Multimedia Communications Systems," November 
2000, and ^^H. 225.0 Call Signalling Protocols and Media Stream 
Packetization for Packet-Based Multimedia Communication 
Systems,'' November 2000, both published by the Telecommunication 

3 0 Standardization Sector of the International Telecommunication Union 

(ITU-T), and both hereby incorporated by reference. SIP and SDP are 
described in the Internet Engineering Task Force standards 
documents ''RFC 2543 - SIP: Session Initiation Protocol, March 
1999, and ''RFC 2327 - SDP: Session Description Protocol," March 

35 1999, both published by The Internet Society, and both hereby 
incorporated by reference. 

[0017] User stations 1 and 6 are also equipped with a special 
purpose IP address sharing application software (special purpose 
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software) that allows the workstations to practice the current 
invention. This address sharing software could, for example, be 
initiated at user station startup by, for example, including a 
shortcut file referencing the address sharing software in the 
5 user station Startup folder. The address sharing software could 
also be initiated when a VOIP call is initiated, or manually 
initiated by the user station operator. 

[0018] In operation of a first embodiment, user stations 1 
and 6 and POTS user 7 establish an audio conference call over IP 

10 network 8 and PSTN 9 over connections 10-15. In the preferred 
embodiment, user station 1 calls user station 6 and establishes 
a call. User station 1, using the conference call Centrex 
feature, then sends flash hook and calls user station 5 and 
establishes a call. In similar fashion, POTS user 7 is added to 

15 the conference call. Establishing the conference call may be 

accomplished using a variety of known methods, including the use 
of Centrex features or dial-in teleconference bridging services 
available from PSTN service providers, or VOIP gateway services 
to the PSTN available from internet service providers. 

2 0 [0019] After the conference call between user stations 1 and 
6 and POTS user 7 has been established, one of user stations 1 
and 6, for example user station 1, invokes the special purpose 
software. This establishes user station 1^ as the conference 
master. The special purpose software on user station 1 then 

2 5 transmits the IP address of user station 1 over the PSTN to all 

other audio conference participants. In the preferred 
embodiment, the IP address of user station 1 is transmitted in 
an in-band message in FSK format. The IP address message may 
also be transmitted in other in-band acoustic signaling protocol 

3 0 formats, for example DTMF, or in out-of-band formats, for 

example ISDN. From a practical standpoint, in-band formats and 
transmit/receive methods that are less audibly disruptive to 
conference participants are obviously preferred. The layout of 
the IP address message can be arbitrary, or it can follow 
3 5 established protocols, for example ADSI. The IP address 

message, in addition to the IP address, may contain a unique 
call identifier or registration/authorization token to associate 
all IP address messages related to this conference call or 
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transaction. The call identifier or token is included in the 
response messages to user station 1 in order to authenticate and 
to ensure that the request for connection is being made by a 
party to the voice conference, not simply a random incoming call 
5 from some other party at an inopportune time. 

[0020] All other user stations on the audio conference call 
having the special purpose software installed (User station 6 in 
the preferred embodiment illustrated in FIG. 1) receive and 
decode the IP address message from user station 1. 

10 [0021] User station 6 then negotiates multimedia capabilities 
with the conference master user station 1. The capabilities 
negotiation takes place over logical connection 16 of IP network 
8, In the preferred embodiment, the SIP protocol is used for 
the negotiation. In general, other protocols that support 

15 capabilities negotiation may be used, for example H.323. 

[0022] After capabilities negotiation has completed between 
conference master user station 1 and the other user stations on 
the audio conference call, extra-audio conferencing between user 
stations may begin. 

2 0 [0023] FIG. 2 shows a message flow diagram of the preferred 

embodiment of FIG, 1. At (a), user station 1 initiates an H.323 
call to user station 6. The call segments between user station 
1 and gateway 2, and gateway 5 and user station 6 take place 
over IP network 8. The call segments between gateway 2 and CO 
25 3, CO 3 and CO 4, and CO 4 and gateway 5 take place over the 

PSTN using the Signaling System 7 (SS7) signaling network. The 
H.323 and SS7 messages transmitted to initiate the call are well 
known . 

[0024] At (b) , user station 6 accepts the call, responds back 

3 0 to user station 1, and a voice connection over PSTN 9 and IP 

network 8 is established. The H.323 and SS7 messages 
transmitted to accept and establish the call are well known. 
[0025] At (c) , user station 1, using the Centrex conference 
feature of CO 3, initiates a second call to POTS user 7. The 
35 call segment between user station 1 and gateway 2 takes place 

over IP network 8 . The call segment between gateway 2 and CO 3 
takes place over the PSTN using the Signaling System 7 (SS7) 
signaling network. 
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[0026] At (d) , POTS user 7 answers and establishes the call 
between user station 1 and POTS user 7. 

[0027] At (e), user station 1 establishes an audio conference 
between user station 1, user station 2 and POTS user 7. 
[0028] At (f), user station 1 transmits its IP address in an 
in-band FSK format message to user station 6 and POTS user 7. 
[0029] At (g) , user station 6 negotiates multimedia 
capabilities with user station 1. The negotiation is conducted 
using SIP over IP network 8. 

[0030] At (h) , user stations 1 and 6 may establish an extra- 
audio conference over logical connection 16 of IP network 8. 
[0031] FIG. 3 shows a second embodiment of the present 
invention in which an H.323 multipoint conference unit (MCU) 17 
is used. The operation of this embodiment is very similar to 
the embodiment of FIG. 1, with the following exceptions. The IP 
address transmitted in-band from user station 1 to the other 
audio conference participants is that of MCU 17 . The 
capabilities negotiation takes place between MCU 17 and user 
stations 1 and 6. The extra-audio conference is established 
between MCU 17 and user stations 1 and 6, and is controlled by 
MCU 17. 

[0032] FIG. 4 shows a message flow diagram of the preferred 
embodiment of FIG. 3. The message flow of this embodiment is 
very similar to the message flow illustrated in FIG. 2, with the 
following exceptions. At (f ) , user station 1 transmits the IP 
address of MCU 17. At (g) . user stations 1 and 6 negotiate 
capabilities with MCU 17. At (h) , the extra-audio conference is 
established between MCU 17 and user stations 1 and 6, and is 
controlled by MCU 17. 

[0033] In the preferred embodiments of the present invention, 
the IP address transmitted in-band by user station 1 may also be 
a DHCP (Dynamic Host Protocol) or other temporary address as 
only the active address is required. Also, IP endpoints may use 
proxy servers to negotiate connections on their behalf, and 
perform media multiplexing, address resolution, etc. 
[0034] In an alternative embodiment, user stations on the 
conference call may respond to the IP address message from user 
station 1 with a PSTN in-band message containing their IP 



addresses. This allows all user stations to receive all other 
user stations' IP addresses, which may be stored and used to 
establish private sidebar sessions, separate from the main 
conference, between two or more user station participants. 
5 [0035] FIG. 5 shows a block diagram of a multimedia user 

station of the present invention. Video codec 50 encodes video 
received from a video source, such as a camera, for transmission, 
and decodes video code received from the network for output to a 
video display. Audio codec 51 encodes the audio signal from a 
10 microphone for transmission, and decodes audio code received from 
the network for output to a loudspeaker. Audio/video 
synchronization 52 operates to control jitter from the received 
audio and video streams, and to achieve lip synchronization 
5 between these streams. Data interface 53 supports data based 

•'Sl5 applications such as whiteboarding, still image transfer, file 
m exchange, database access, etc. System control 54 provides for 

S the proper operation of the user station. It provides call 

S control, capabilities exchange, signaling of commands and other 

fll messages. Network interface 55 formats the transmitted video, 

20 audio, data and control streams into messages for output to the 
Z network, and receives the video, audio, data and control streams 

O messages from the network. The special purpose software of the 

H present invention would utilize system control 54 and network 

2 interface 55. Generally, to practice the present multimedia 

25 conferencing invention, video codec 50 is not required. 

[0036] While the present invention has been shown and 
described with respect to exemplary embodiments, it will be 
understood by those skilled in the art that modifications may be 
made thereto without departing from the scope and spirit of the 
30 invention. It is intended that the scope of the invention be 
defined by the claims appended hereto and their equivalents. 
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